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Abstract 

For the Erdos-Renyi random graph G n ^ p , we give a precise asymptotic formula for the 
size &t {Gn,p) of a largest vertex subset in G n , P that induces a subgraph with average degree 
at most t, provided that p = p(n) is not too small and t = t(n) is not too large. In the case 
rS^ \ of fixed t and p, we find that this value is asymptotically almost surely concentrated on 

at most two explicitly given points. This generalises a result on the independence number 
of random graphs. For both the upper and lower bounds, we rely on large deviations 
inequalities for the binomial distribution. 

1 Introduction 

m 

Given a graph G = (V, E) and a non-negative number t, a vertex subset S C V is t-sparse if 
the subgraph G[S] induced by S has average degree at most t. The order of a largest such 
subset is called the t-sparsity number of G, denoted at(G). The t-sparsity number &t(G) is 
a natural generalisation of the independence number a{G). Recall that an independent set 
is a vertex subset of G with no edges, i.e. a 0-sparse set; thus the order a(G) of a largest 
independent set is just &o(G). Note that &t(G) is non-decreasing in terms of t. 

We investigate the asymptotic behaviour of at(G n , p ), where G ntP is a random graph with 
vertex set [n] = {1, . . . , n} and each edge is included independently at random with probability 
p. We focus on fairly dense random graphs: our main result holds when p = p{n) satisfies 
p > re _1 / 3 + e for some fixed e > and p bounded away from 1. We say that a property holds 
asymptotically almost surely (a.a.s.) if it occurs with probability that tends to 1 as n — > oo. 

For t = 0, that is, the independence number, the asymptotic behaviour in dense random 
graphs was described forty years ago by Matula [21 [151 02] , Grimmett and McDiarmid [9| , 
and Bollobas and Erdos [I]. For given < p < 1, define b = 1/(1 — p) and 

a p (n) = 2 log b n - 2 log fe \og b (np) + 2 log 6 (e/2) + 1. 

It was shown that for any 5 > a.a.s. \a p {n) — 5\ < a{G UtP ) < \a p {n) + <5J. The main 
objective of this paper is to provide an analogue of this for c\t{G ritP ). 

Some previous estimates on dt(G n ,p) are implicit in the work of two of the authors. In 
particular, for fixed p, it was observed using a first moment argument that for any e > 0, even 
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if t is a growing function of n, as long as t = o(ln(np)), we have at{G n ^ p ) < (2 + e) log b (np) 
a.a.s., cf. [121 Lemma 2.1]. It follows that at{G n)P ) and ct(G n>p ) share the same first-order term 
growth if t = o(ln(np)). Furthermore, if t = u(ln(np)), then (l—e)t/p < at{G n ^ p ) < (l+e)t/p, 
cf. |12t Lemma 2.2]. If t = 0(ln(np)), then the growth of the first-order term of OLt(G np ) is 
a multiple of log b (np), and large deviation techniques were used to determine the factor 
(which depends on p and t) [13]. (With the exception of the precise factor at the threshold 
t = 0(ln(np)), these statements have been shown to remain valid for smaller values of p as 
long as p » 1/n, cf. [HI Theorem 4.18].) 

In this work, we present a sharper description of att(G ntP ), using a finer application of the 
above-mentioned methods. However, we do not concern ourselves with the entire range of 
choices for the growth of t as a function of n, as above. To get our sharp formula with second- 
and third-order terms, p = p(n) must not tend to too quickly, and t = t(n) must not grow 
too quickly. For < p < 1, define b = 1/(1 — p) and 

a t)P (n) = 2 log 6 n + (t-2) log,, log b (np) - t log fe t + t log b (2bpe) + 2 log 6 (e/2) + 1. (1) 

Observe that &o lP (n) = a p (n) (under the convention that OlnO = 0) and also at, p (n) = 
a p (n) + tlog b ((2bpe/t) log b (np)). We prove the following. 

Theorem 1. Let < p = p(n) < 1 be such that p is bounded away from 1 and p > n _1 ' 3+e , 
for some positive e < 1/3. Suppose t = t(n) > and 5 = 5(n) > satisfy t = o(lnn/lnlnn) 
and £ 2 lnlnn/lnn = o(p5). Let &t, P (n) be as defined in (pQ). Then \_&t )P (n) — 5\ < c\t(G n>p ) < 
[&t, P (n) + 8\ a.a.s. 

We see then that at{G n)P ) is concentrated around &t tP (n) in an interval of width approximately 
£ 2 lnlnn/(plnn). Thus, if t 2 = o(p In n/ In In n) , then we have two-point concentration (or 
focusing), on explicit values. 

Let us mention another related generalisation of the independence number. Given a graph 
G = (V, E) and a non-negative integer t, a vertex subset S C V is t-dependent (or t-stable) 
if the subgraph G[S] induced by S has maximum degree at most t. The order of a largest 
such subset is called the t-dependence (or t-stability) number of G, denoted at(G). Easily, 
ott{G) < &t{G). In [8], we considered at(G n>p ), with our attention restricted to fixed p and 
fixed t, in order to apply analytic techniques to the generating function of degree sequences 
on k vertices and maximum degree at most t. For < p < 1, define 

a t , p (n) = 2 log fe n + (t - 2) log fe log 6 (np) + log^/t! 2 ) + t \og b {2bp/e) + 2 log 6 (e/2) + 1. 

We showed in [8] that for any fixed 5 > 0, [at )P (n) — 5\ < ctt(G n>p ) < L a *,p( n ) + ^J a.a.s. 
Note that in this setting the difference between the t-sparsity and the t-dependence numbers 
of G njP is essentially 6>t, P {n) — a>t tP (n) = 21og fo (t!e 4 /t l ). By Stirling's approximation for t\ 
(cf. [3J), we have that at tP (n) — at tP (n) ~ log fe (27rt) as t — > oo. 

We also comment here that, even if t is fixed, the property of t-sparsity is not hereditary, 
i.e. t-sparsity is not closed under vertex-deletion. Hence the general asymptotic results of 
Bollobas and Thomason [5] (developed in a long line of research that can be traced back to 
early results of Alekseev [1] , cf . also [2] ) , for partitions of random graphs according to a fixed 
hereditary property, are not applicable here. In our previous studies [3 [13] , it was useful that 
t-dependence is hereditary for fixed t. Unfortunately, this is not the case for t-sparsity. 

As will become apparent, challenges arise in the second moment computations. We have 
split this into several parts, according to the degree of overlap between two fc-subsets of [n]. 



Furthermore, in each part we must carefully account for the number of edges which are, say, 
within one of the fc-subsets but not the other, or strictly contained in the overlap, and so on. 
This careful accounting makes use of large deviations bounds for the binomial distribution. 

The term "sparse" may take on a number of different meanings in graph theoretic or al- 
gorithmic research. Instead of bounding average degree, one could instead bound for example 
the degeneracy (i.e. the maximum over all subgraphs of the minimum degree) or the maximum 
average degree. The counterparts of t-sparsity for these alternative versions of "sparse" are 
certainly of interest, but we do not pursue them here. We remark only that the counterpart 
for the former example is bounded below by at, while for the latter example it is necessarily 
bounded between at and at- 

It is worth noting that the algorithmic complexity of computing the t-sparsity of a graph 

- for the special cases of t fixed or t parameterised in terms of the order of the target set 

— was recently studied by Bourgeois et al. [B] and, perhaps unsurprisingly, NP-hardness was 
shown to hold even in the restricted case of bipartite graphs. 

Our paper is organised as follows. In Section [51 we outline the large deviations results 
that we employ. In Section [3j we perform first moment calculations to obtain Lemma [6j 
this lemma implies the upper bound in Theorem [TJ In Section [U we give a second moment 
calculation (Lemma [7|) which implies the lower bound in Theorem [TJ 

2 Large deviations 

In this section, we state the large deviations techniques used to precisely describe the average 
degree of a fc-set (a vertex subset of order k) in G n>p . For background into large deviations, 
consult Dembo and Zeitouni [TJ; we borrow some notation from this reference. Given < p < 
1, we let q = 1 — p throughout. Also, let 

. ± , . xln — h (1 — x) In for x E [0, 11 

A*( x ) = J p K ' q L J 

I oo otherwise 

(where A*(0) = ln(l/g) and A*(l) = ln(l/p)). This is the Fenchel-Legendre transform of 
the logarithmic moment generating function associated with the Bernoulli distribution with 
probability p (cf. Exercise 2.2.23(b) of [7]). Some easy calculus verifies that A*(x) has a global 
minimum of at x = p, is strictly decreasing on [0,p) and strictly increasing on (p, 1]. 

In the next lemma — a large deviations result for the binomial distribution — the upper 
bound follows easily from a strong version of Chernoff's bound, e.g. (2.4) in [TO], while the 
lower bound is implied by a sharp form of Stirling's formula, e.g. (1.4) of [3]: see the appendix 
of |13| for an explicit proof (when r is integral). 

Lemma 2. There is a constant 5 > such that the following holds. Let < p < 1, let N be 
a positive integer, and let X G Bin(A r ,p). Then, for each 1 < r < N — 1 such that r < Np, 



5 ■ max < r 



~ 1/2 , (N - r)- 1/2 \ ■ exp(-iVA*(r/AO) < P(A < r) < exp(-NA*(r/N)). 



Lemma [2] immediately yields the following estimate on the probability that a given set of 
size k is t-dependent. For a graph G, we let deg(G) denote the average degree of G. 

Lemma 3. Suppose < p = p(n) < 1 and suppose that t = t(n) > 1 and the positive integer 
k = k(n) satisfy that t < p(k — 1). Then 



(i) P(deg(O t4 ,) < t) < exp ( - ( „ ] A* ( 7 — r ) ) ; and 
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(ii) P(deg(G fc , p ) < t) > exp (- Q A* (--^j - \ In fc + 0(1)) 



For the second moment estimation, we will make use of the following asymptotic calcula- 
tions, the proofs of which are postponed to the appendix. 

Lemma 4. Suppose < p = p(n) < 1 and suppose the non-negative number t = t(n) and 
positive integer k = k(n) satisfy that t = o(p(k — 1)). For any e = e(n) with \e\ < 1, 

Lemma 5. Suppose < p = p(n) < 1 and that x = x(n) = o(p). Then 

A*(x) = In 6(l + o(l)). 

We remark that we will throughout make implicit use of the fact that, for < x < 1, 
—x/(l — x) < ln(l — x) < —x. 

3 An expectation calculation for the upper bound 

In this section, we consider the expected number of i-sparse /c-sets. Note that the range of 
valid values for p in the following lemma is not as restrictive as for Theorem [H and that the 
conditions for t and 5 are accordingly more general. 

Lemma 6. Let < p = p(n) < 1 be such that np —> oo as n — > oo and p is bounded 
away from 1. Suppose t = t(n) > and 5 = 6(n) > satisfy t = o(ln(np)/lnln(np)) and 
£ 2 log b ln(rep)/ln(np) = o(5). Let &t jP (n) be as defined in ([1]). Let k + = \at, p (n) + 5] and 
k~ = \pit,p{n) — b~\ and letS nt ^+ andS nt )~- be the collections oft-sparse k + -sets and k~ -sets, 
respectively. Then 

n\Sn,t,k- \) > exp ((1 + o(l))6 In(np)) and 
n\Sn,t,k+ 1) < exp (-(1 + o(l))S ln(np)) . 

Proof. Note that In 6 = (1 + o(l))p if p — > as n — > oo. For almost the entire proof, the 
calculations are carried out in terms of k, instead of k + or k~ . 
By Lemma O 



^,ul)=(;)exp(-Q)A*(^) +0(1 n*)) 



-feJ/LiV-m+o'" 



k ) v V 2 / \ k - l J V k 

( , i , , fk-l\ kif ( t \ „ /Yak 

exp 1 + In n - In k - A* + O 



k-l V k 



therefore, 

2toE(|^,, t |) =2 + 21nn _ 21n) ,_ (t _ 1)A ,/ t\ (M ] (2) 

Let us now expand one of the terms in §2§ using the formula for A* : 

= tlnt-t ]n(p(k - 1)) + (k - t - 1) In ( 1 - — — j + (k - t - 1) In b. 

Since \t/(k — 1)| < 1 for n large enough, we have by Taylor expansion that 

t \ t t 2 t 3 



]uii -—i) = -—i-w^w~w^w~'''' and 



giving that 

2\n.E(\S ntt ,k\) 



k 

ft 2 + lnfc" 
2 + 21nn-21nfc-thit + tlii(p(Jb-l))+t- (k - t- l)ln6 + 0( = ) . (3) 



Now, since t > 0, np — > oo and t < ln(np) for n large enough, it follows that k > 2 log b (np) 
2 log b ln(np) and 



In n — In k < In n — In (2 log fe (np) — 2 log 6 ln(np)) 
< In n — In ln(np) — ln(2/ In b) — In I 1 



In In (np)\ 
ln(np) J 

< In n - In ln(np) - ln(2/ In b) + O ( n / ^ ) 

\ In (np) / 

for n large enough. Furthermore, for n large enough, 

tln(p(k — 1)) < £ln(p(21og 6 (np) + 1 log 6 In (rip))) 

< t In ln(np) + t ln(2p/ In b) + i In ( 1 + n " -j^f ) 

< t In ln(np) + £ ln(2p/ In 6) + 



2 ln(np) / 

t 2 In In (np) 



ln(np) 
Similarly, for n large enough, 

(t In ln( nz? ) \ 
— i — 7 — \ — ) an d 
ln(np) J 



t\n(p(k- 1)) > tlnln(np) + t ln(2p/ In 6) + 



£ In In (np)\ 
ln(np) y 



so that 

(t In ln( nu ) \ 
— i — 1 — \ — ) an d (^) 

ln(np) / 

t ln(p(Jfe - 1)) = t In ln(np) + £ ln(2p/ In 6) + O ( " . n \ P ^ ] . (5) 

\ ln(np) / 

Until here, our calculations did not depend on using k + or k~ , but now we have 

(k~-t-l)hxb< 

21nn+(i-2)lnln(np) - (t - 2)lnlnb - tint + tln(2pe) + 21n(e/2) ±Slnb 
< (k + -t-l)lnb. 

Substituting the last inequalities together with ^ and ([5]) into ©, we obtain, for n large 
enough, 

JbK^-|) >0 /^n +0 /^ +<ht u 

/c \ m(np) / \ k ) 

since £ 2 lnln(np)/ln(np) = o(51n6) and A; > ln(np). Now, substituting the expression (1 + 
o(l))21og b (np) for k + or k~ completes the proof. □ 

For illustration, let us consider the case of p and t fixed. To satisfy the conditions in 
the above lemma we need 5 Inn/ In Inn —> oo as n — > oo. So we may, for instance, set 
5 = (In Inn) 2 / Inn. We find that the expected number of i-sparse sets of size k~ tends to 
infinity. The probability that there is a t-sparse set of size at least k + is at most E(|<S ra t k + 1) — > 
Oasn^ oo, and so &t(G nyP ) < [at, p (n) + 5\ a.a.s. 

4 Second moment calculations for the lower bound 

Lemma 7. Let < p = p(n) < 1 be such that p is bounded away from 1 and p > n _1 ' 3+e , 
for some positive e < 1/3. Suppose t = t(n) > and 5 = 5(n) > satisfy t = o(lnn/lnlnn) 
and t 2 In In nj In n = o(p5) . Let aj iP (n) be as defined in (pQ) . If k = k(n) = \at, P {n) — 5\ , then 

n& t (G njP ) < k) = (i). 

Proof Let S n j t k be the collection of t-sparse fc-sets in G nyP . By Lemma El 

n\Sn,t,k\) >exp((l + o(l))<51n(np)). (6) 

We use Janson's Inequality (Theorem 2.18(ii) in [10J: 

P(««(G„, P ) < t) = P(|5„,«| = 0) < ex P (-Jg^JL) , (7) 

where 

A= Y. P(A,S€5 ftit)Jfe ). 

A,BC[n],K\AnB\<k 
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We will split this sum into three sums according to the size of \A fl B\ which we denote by £. 
In particular, let p(k, £) be the probability that two fc-subsets of [n] that overlap on exactly £ 
vertices are both in S nt t t k- Thus, 



fc-i 



A = EC)(»)(rflKM). 



£=1 



k \£ 



For I E {1, . . . , k - 1}, let f(£) = (™) (*) { n k Z k £ )p(k, £)■ We set A x = ek/2 and A 2 = (1 - e)k. 
(In fact, we shall assume throughout our proof that e < 1/4; note that this assumption still 
implies the lemma.) Now we write A = Ai + A 2 + A3 where the parameters Ai and A 2 
determine the ranges of the three sums into which we decompose A: 

Ai = E /( £ )' A 2 = E /(*)» and A 3 = E /(*)■ 

1<£<Ai Ai<£<A 2 \ 2 <t<k 

We will show that for i G {1,2, 3} we have 

Ai = o(E\S nAk )). 

So then the result follows from (J7J). 

To bound Aj for each i G {1,2,3}, we consider two arbitrary fe-subsets A and B of [n] 
that overlap on exactly £ vertices, i.e. \A n -B| = ^, and estimate p(k,£) by conditioning on 
the set £[iflB] of edges induced by An B. In each of the three regimes, we need slightly 
different techniques to estimate p(k,£). 



Bounding Ai 

To bound Ai, we write 

P (k,£) = F(A,BeS n + k ) 



HA g S nittk I B g S nAk ) ■ P(B g 5 n , 



t,k) 



The property of having average degree at most t is monotone decreasing, so the conditional 
probability that A G S n: t t k is maximised when £[A fl B] =0. Thus 

F(^4 G S nAk I B G S nAk ) < P(A G S n>tjk I E[A n B] = 0) 



< P(A£M 6 (3pf Ae /y 



i.fej 



implying that p(M) < 6(2) P 2 (A G <Sn,t,fc)- 
We have though that for n large enough 

(9 (*:!) < 2 © -» fc -V(fc - 0! 



G) 



n k /kl 



n" 



Thus 



Ai < 



c (^4 G S^ 



t.fc. 





We set 



4 ip. 

TV 



Thus we write 



Ai < 2 ■ E 2 (S n , t<k ) Yl s t- 

2<£<Ai 

We will show that this sum is o(l). 

The following claim regards the monotonicity of {sg} for £ in the range of interest. 

Claim 8. If n is large enough, then for any 2 < £ < Ai we have S£ + i/s£ < 1/2. 
Proof. We have 

S£+i = (fc ~ £? ti_ < fc2 feAl = Q / n £ log 2 n 
s^ £ + 1 n — n \ np 2 

as & ' = 0(n e ). But as p > n _1 ' 2+£ , we have np 2 > n 2s and, therefore, S£+i/s£ < 1/2, for 
large enough n. D 

Thus the sum J2e<\ s £ ls essentially determined by its first term s 2 : 

^2 s e < 2s 2 - 
e<\i 

But we have 

u4\ /i 4 \ / i„„4 



^ \ ^i /l°g n \ /-> / 2 l°g n 



\n 2 J \ n 2 p 4 / \ (np) 4 J 

■\{p>n- l / 2+e . 

Bounding A 2 

The bound on A 2 = ^a <£<x f(P) involves a more thorough consideration of the number of 
edges in the overlap between the sets A and B. 

Let us fix some integer £ such that X\ < £ < A 2 . We will show that f (£)/¥? (\S nt t, k \) = 
o(l/k). With A, B being two sets of vertices, each having size k, that overlap on £ vertices, 
we have 

f{£) _ (£j) (J) ¥(A,BeS nAk ) 



^ 2 (\S n ,t,k\) " (2) p2 (^ e 5 n>t)fc ) • 
The first ratio on the right-hand side can be bounded for n sufficiently large as follows: 

(3M ^ofen-i n"-'/(k-e)\ ^ ok+1 (k\ a ^ o2k+1 fk 



fn\ 



n k /k\ ~ \£J n l ~ \n) 



We now give estimates on ¥(A,B £ S n ,t,k) as well as on ^(^4 € 5 n tfc)- F° r each set 
A of vertices, let -EL4] denote the set of edges with both their endvertices in A, and let 



e(A) = \E[A]\. Also, let e'(A,B) = e(A) - e(A n B) , the number of edges in E[A] \E[An B). 
Setting I = An B, we have 

F(A,B G Sn,t,k) < P(e(I) < kt/2) ■ ¥ 2 (e'{A,B) < kt/2). 

We will bound the two probabilities on the right-hand side of the above inequality using 
Lemma EJ As e(J) G Bin (Q,p\ and e'(A,B) G Bin (g) - Q,p), with Xl = kt/(£(£ - 1)) 
and xa,b = kt/(k(k — 1) — ^(^ — 1)) we have 



P(e(I) < fci/2) = exp I - 1 1 A* (xj) + O (In fc) 

P(e'(AB) < kt/2) = exp (- (Q - Q) A* (x A , B ) + O (Ink) 

Now, note that both xj and x^s are o(p). This holds since xj,xa,b = 0(t/k) and A; = 
©(In n/p) and £ = o(lnn/lnlnn). But now we can apply Lemma [5] to obtain 

P(e(I) < fct/2) • P 2 (e'(A,B) < fct/2) 

= exp(-Qln6(l + (l))-2(Q)-Q)ln6(l + (l)) + 0(lnfc)) (1Q) 

= exp (( J In 6 - 2( J ln&+ o(A; 2 p) 

Similarly, 

P(^4 G S n * fc ) = exp ( - f f ) In 6 + o(fc 2 p) ) . (II) 



,2 
Hence the estimates in (fTU|) and (fTT]) yield 

. -B G <5 n *fc) 



exp II In 6 + o(fc p) 



Now, combining the above together with ([U]) and the right-hand side of ([5]), we obtain 



exp I — £ In n + £ In fc + I I In b + o(fc p) 



E 2 (|<S„,t, fc |) "V V2 

/ / £ In 6 

= exp I — ^ I In n — In /c \- o(kp) 



(12) 



We will show that Inn — In k — £ In b/2 — > oo as n — > oo, for any Ai < £ < A 2 . Recall that 
A; = (2 + o(l)) logi,(np). Thus Inn — In k = In (nln &/ (2 In (np))) + o(l) > In (np) + O(lnlnn) 
as In 6 > p. Also, as ^ < (1 — e)/c, we have £ In 6/2 < (1 — e) ln(np)(l + o(l)). Therefore 

In n — In fe > e ln(np) + o(ln n) . 



These two bounds substituted into (|T2|) now imply that 

M 



^ 2 (\Sn,t,k\) 



exp(-n(£lnn)), (13) 



uniformly for all Ai < £ < A 2 - But since £ > ek/2, this bound is o(l/k) and therefore 

A 2 = o(E 2 (5 nAfc )). 



Bounding A3 

Next, to bound A3, the aim here is also to show that for I > A 2 we have 

'W -o(l 



E 2 (|5„,,, t |) V* 

This is the portion of A that is the most difficult to control. It is also the regime in which 
the condition p > ^ -1 / 3 + e j s required. (We only required the weaker condition p > n~ l ' 2+e 
to bound Ai and A2.) In this regime, we need to separately treat two sub-regimes which are 
divided according to the edge count in the overlap. 
Let us consider an arbitrary £ > A2 and write 

[tk/2] 
m=0 

where p(k,£,m) = F(A,B € S n j t k A e(A n B) = m). (Note that m < tk/2 or trivially both 
A,B^ S n tk-) We split this summation in two: 

A» L* fe / 2 J 

p(k,£) = ^p{k,l,m) + J^ p(k,£,m) =: Pl (k,£) + p 2 (k,£), (14) 

with \i = max{0, \tk/2 — {k — £){k + £ — l)V>J>/2j}, where ip is the unique < ip = ip{n) < 1 
such that A*(tpp) = (1 — £) In b, for some fixed < £ < 1 yet to be specified. 

That if) exists is guaranteed by the fact that A* is strictly decreasing on [0,p), A*(0) = In 6 
and A*(p) = 0. We now show that ip is bounded away from 0. We have that i/j satisfies 

_ l _ A*(V>p) _ 1 _ /_0P ln , !-#> ln l-ifrp 
In 6 ylno ln& q 

, # ^ 1 (1 -ipp)\n(l-tf;p) 

hi b if) In 6 

# 1 # 

< w + rr m t + rr 

In -^ mo 
since (1 — x) ln(l — x) > — x for < x < 1. Thus, using also p < ln6, 

£<V'P + V' m — +V>< ^(2 + In ip) 
tfi 

But x(2 + lnx) — > as x \ 0. Hence there exists 5 = <5(£) > such that ip > 5 uniformly 
over p. 

Let us give a bound on p\(k,£). We may assume that (fc — £)(&+•£ — l)ipp < tk, or else the 
sum is empty. It will suffice to consider E[A n B] alone. Observe that e(A n .B) is binomially 
distributed with parameters ( 2 ) and p. But £ > A2 = f2(ln n/p), and so since t = o(lnn) it 

follows that \x < tk/2 = o (p( 2 ) ) • Thus, by Lemma [51 



Pi (k, £) < P(e( A n S) < n) < exp f - Q A* ^ / ^ 



2y V^- 1 ) ^-1) 
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))■ 



By Lemma IH since tk = o(p£(£ - 1)) and < (k - £)(k + I - l)ipp < tk, 

n /a ^ ( (^(kJ tk \ ,, ,^dk-£)(k+t-l)lfo, p£(£-l) 

Pl (k,£) <exp - J A* — — - +(i + (i))i ') — ^ln F 



2) \ \£{£-l)J v v " £(£-!) tk 

--(-0) A -(^)-^^( ftzaJ 1 ±£za * , - !S 5 Ji ))- (15) 

To estimate p2(k,£), we need a finer argument in which we also consider the sets E[A] 
and -E[-B] of edges induced by A and B, respectively. In particular, let X\ and X2 denote 
e'(A,B) (recall that this is e(A) — e(A n B)) and e'(B,A), respectively. Note that X\ and 
X2 are binomially distributed with parameters £(k — £) + ( I ) = (A; — ^)(/c + £ — l)/2 and p. 
Furthermore, Xi and X2 and e(^4 D B) are independent. Therefore, 

P2(fc, £) < P(e(A HB)< tfc/2) • P 2 (Xi < tfc/2 - ft - 1). 
By Lemma O (since tk/2 = o (p( 2 ) ),) 



P(e(,4nB) < tfc/2) <exp( -( )A* 



and (as < if) < 1) 



?(A'i S //-V2 - // - I ) < F ( .Vi < (fc ^+^ ^ 

,_(*-0(* + <-l) (1 _ 0hit . 



We conclude that 

K(M) , exp (_Q A . (_*_) _ (^K|±^) (2 _ 201n6 ) . (16) 

Comparing with (fT5j) . since ifc = o(p£(£ — 1)) and V> = @(1)> we notice that pi(k,£) is 
asymptotically smaller than the above upper bound on p2(k,£). 
Now, from £ > A 2 it follows that 

£{£-l) > {k-£)(k + £-l). 

Indeed, (k — £)(k + £ — 1) < k 2 — £ 2 < k 2 — (1 — e) 2 k 2 < 2ek 2 and also for n sufficiently large 
£(£ - 1) > (1 - e) 2 k 2 > (1 - 2e)k 2 . As e < 1/4, the above inequality holds. 
Thus, since t = o{p{k — 1)), we obtain using Lemma[l]that 

AM » wiW(i+ ( *-° ( * +/ - 1) 



2) \£{£-l)J \2J \\ £{£-!) Jk-1 

2) \k-\) v \2) £(£-l)k t 

A- ( «A - (1 + „(!))» -M + '-l) J**/*) 



2) \k-lj x w/ 2 ^ pk/t 
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Furthermore, since A* is strictly decreasing on [0,p) and A*(0) = In 6, 

a*( —\ (( k ] (k-exk + e-i^ ^f t 



2) \k-lj \\2J 2 ) \k-\ 

k\ r ( t \ (k-£)(k + £-l) ^f I 



^.^yst^ti^J' 1 (18) 

Combining (|16 p - (|18p . we conclude that 

MM) <«p(-(a) A * (^y)- a + »(i)) (fc "^ )(fc 2 + ^" 1) (1 - 2 ^ lnfe ) • (19) 

As remarked earlier, pi(k, £) is asymptotically smaller than the upper bound for p2(k,£). 
Hence it suffices to show that 



W(\S n ^ k \) °\k 

Recall that with A being a set of vertices of size k we have 

n\s n ,t,k\) = (fj m e Sn,t, k ) = (fj exp (-Q) A * (^x) + °( lnfc ) 

where the last equality follows from Lemma El Thus, using (|19p . we have 

e 2 (|<W|) 



®(\Sn,t,k\) 

Now 



exp -(l + o(l))- ^— ^(l-2f)ln& + 0(lnfc) 



/A ^:^£(A-»,*-' 



(20) 



Thus using the lower bound on EdSn^l) given in ([6]) we obtain 

(*)(}) (E2) **(M) 



In- 



E 2 (|<WI) 



(Jfe - *)l*(lfen) - (1 + o(l))<Jln(np) - (1 + o(l)) ~ ^ fc + ^ (l - 2f) In 6 + O(lnfc). 
Now, we have for n sufficiently large 

(jfe - £) ln{kn) - (1 + o(l) y k ~ ^ + 0- ~ 2 ^ ln b 
<(k-£) (ln(nk) - ^"^ (1 - 2£) ln 6 
< (k - £){\n(nk) - (2 - 2e) ln(np)), 

12 



(21) 



where in the last inequality we used a choice of £ small enough as well as the fact that 
klnb = (1 + o(l))21n(np). But also k < (1 -+- o(l))21n(np)/p, as In b > p. This implies that 
In k < Inln(np) — lnp + 0(1). Hence, for n sufficiently large, 

(k - t) ln{kn) - (1 + (1))^— e lt±£L (i _ 2f) In 6 

< (fc-f)(-lnn + lnln(np) + 0(1) - 3 hip + 2eln(np)) 

< (A;-£)(-lnn-31nn" 1/3+e + 3eln(np)) < 0, 

where we used the condition p > n _1 ' 3+e i n the second last inequality. Substituting this into 
(|2"T|) . we obtain 

U) U) ^-^ P2(/c 'fl< eX p(_(i + (i)) 5 i n ( np ) + o(i rifc )). 



E 2 (|^ 



*,/.'! 



But In Inn/ Inn = o(p5) and therefore lnlnn/p = o(5ki(np)). On the other hand, Ink 
O (ln(lnn/p)), which implies that In A; = o(5\n(np)). Therefore 



(I)C)K)p2(M) n 







as required. D 
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A Appendix 

Proof of Lemma^ We split the proof into two cases. First, if e = —1, then 



k - 1 p(k-l) V k ~ 1 

t , t ( t \ , / t \ t 1 
In — + ( 1 - In 1 - - In ■ 



k-1 p(k-l) \ k-1 J V k-1 J k-1 q 
ln r^-^T-T— 7 + °(t2 =-(l + o(l)) r ln^- 



fc-1 p(k-l) k-1 \k 2 J "k t 

(l + o(l))fln^ 
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(where we used t = o(k) and the Taylor expansion of (1 — t/(k — l))ln(l — tj(k — 1))). 
Otherwise, — 1 < e < 1 and 

A , / (l + e)A = f (l + e)t \ n (l + e)t | / (1 + e)t \ ^ k - 1 - (1 + e)t 



k-1 J \ k-1 ) p{k-l) \ k-1 J q(k-l 

t , (l + e)t et , (l + e)i 
■ In \z -r + -, In ■ 



k-1 p(fc-l) fc-1 p(Jfe-l) 

£ \, fc-l-(l+e)t et , fc-l-(l+e)t 

+ 1 — In - : In ■ 



k-1) q(k-l) k-1 q(k-l) 

\ ( r^rl + r^T Hi + e)+(l- ^—) In (l 



k-1) k-1 v ' V k-1) V fc-l-t 

+ _f*_ln. ?(! + *)* 



fc-1 p(fc - 1 - (1 + e)t) 
and the lemma follows by observing that, by Taylor expansion, 



k-1 y ' \ k-1) \ k-l-t) \ k 

et g(l+e)t et pk 

ln Z7I — i /i , Zva =-( 1 + °( 1 ))t ln - 



k-1 p{k - 1 - (1 + e)t) K w/ k t 



Proof of Lemma\^ Since (1 — x) ln(l — x) = 0(x) as x — > 0, 



D 



A*( x ) = xln I - ] + (1 -x)lnb+ (1 - x) ln(l - x) 

_ / x /x\ r x 

\ In 6 \P J Vln6 

But p = 0(ln6) and x = o(p), and the lemma follows. □ 
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